Brno University of Technology at TRECVid 2010 SIN, CCD
نویسندگان
چکیده
1. The runs differ in the types of visual features used. All runs use several bag-of-word representations fed to separate linear SVMs and the SVMs were fused by logistic regression. *F_A_Brno_resource_4: Only single best visual features (on the training set) are used – dense image sampling with rgb-SIFT. * F_A_Brno_basic_3: This run uses dense sampling and Harris-Laplace detector in combination with SIFT and rgb-sift descriptors. * F_A_Brno_spacetime_1: This run extends F_A_Brno_color_2 by adding space-time visual features STIP and HESSTIP. 2. Combining multiple types of visual features improves results significantly. F_A_Brno_color_2 achieve more than twice better results than F_A_Brno_resource_4. The space-time visual features did not improve results. 3. Combining multiple types of visual features is important. Linear SVM is inferior to non-linear SVM in the context of semantic indexing. 1. Two runs submitted, but with similar settings; the difference is only in amount of processed test data (40% and 60%) • brno.m.*.l3sl2: SURF, bag-of-words (visual codebook: 2k size, 4 nearest neighbors used in soft-assignment), inverted file index, geometry (homography) based image similarity metric 2. What if any significant differences (in terms of what measures) did you find among the runs? • only one setting used – no differences 3. Based on the results, can you estimate the relative contribution of each component of your system/approach to its effectiveness? • slow search in reference dataset due to unsuitable configuration of used visual codebook 4. Overall, what did you learn about runs/approaches and the research question(s) that motivated them? • change the way of describing the video content – frame based (or key-frame based) approach is not sufficient
منابع مشابه
JOANNEUM RESEARCH and Vienna University of Technology at TRECVID 2010
We participated in two tasks: semantic indexing (SIN) and instance search (INS).
متن کاملAT&T Research at TRECVID 2010
AT&T participated in two tasks at TRECVID 2010: contentbased copy detection (CCD) and instance-based search (INS). The CCD system developed for TRECVID 2009 was enhanced for efficiency and scale and was augmented by audio features [1]. As a pilot task, participation in INS was meant to evaluate a number of algorithms traditionally used for search in a fully automated setting. In this paper, we ...
متن کاملAT & T Research at TRECVID 2011
AT&T participated in two tasks at TRECVID 2011: contentbased copy detection (CCD) and instance-based search (INS). The CCD system developed for TRECVID 2010 was enhanced for speed and augmented with an additional picturein-picture detector and alternative audio features [1]. As a pilot task, participation in INS evaluated object-level contentbased copy detection and created a basis for integer-...
متن کاملParticipation at TRECVID 2011 Semantic Indexing & Content-based Copy Detection Tasks
Semantic Indexing Task (SIN) Run No. Run ID Run Description infMAP (%) 1 F A IUPR-DFKI 1 Fisher Kernel + SVMs 2.86 2 F A IUPR-DFKI 2 Color Correlogram + SVMs 5.38 3 F A IUPR-DFKI 3 Fisher Kernel fused with Color Correlograms + SVMs 5.0 4 F A IUPR-DFKI 4 Fisher Kernel + kNN 0.71 Content-based Copy Detection (CCD) Run No. Run ID Run Description Opt.NDCR 1 *iupr-dfki.fsift F-SIFT+BoW+HE+EWGC 0.776...
متن کاملNotebook Paper Brno University of Technology at TRECVid 2013 Interactive Surveillance Event Detection
In the paper, we describe our experiments in the interactive surveillance event detection pilot (SED) of the 2013 TRECVid evaluation [13]. Our approach inherits functionality of the Surveillance Network Augmented by Retrieval Event Detection (SUNAR-ED) system, which is an information retrieval based wide area (video) surveillance system being developed at Faculty of Information Technology, Brno...
متن کامل